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Title: 

Description : 
Perfect Score: 
N.A. Sequence: 
Comp : 

Scoring table: 



>US-08-770-564A-l 

(1-981) from US08770564A. seq 

981 

1 CTGCAGAGGATAGAAAAAAG ACTTAGTTCCTGCTCTGCAG 981 

GACGTCTCCTATCTTTTTTC TGAATCAAGGACGAGACGTC 

TABLE default 
Gap 6 
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Database: u-embl51_101 

142:partl 143:part2 

Statistics: Mean 10.906; Variance 4.540; scale 2.402 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


Pred. No. 




1 


933 


95. 


, 1 


2420 


72 


131750 


Sequence 3 from paten 


0 . 00e+00 




2 


670 


68. 


,3 


962 


142 


S79400 


hTR=RNA component of 


0.00e+00 




3 


596 


60. 


,8 


598 


96 


HSU85256 


Human telomerase RNA, 


0 . 00e+00 




4 


514 


52. 


, 4 


554 


i? 


T^l 7 A Q 


Sequence 1 frc-m paten 






5 


451 


46. 


,0 


451 


96 


HSU86046 


Human telomerase RNA 


0 . 00e+00 




6 


65 


6. 


, 6 


560 


105 


MMU33831 


Mus musculus telomera 


1.01e-32 




7 


38 


3. 


, 9 


215 


71 


128278 


Sequence 5 from paten 


2.28e-10 


c 


8 


35 


3. 


, 6 


215 


71 


128278 


Sequence 5 from paten 


3.99e-08 


c 


9 


30 


3. 


, 1 


30 


72 


131769 


Sequence 22 from pate 


1.45e-04 


c 


10 


26 


2. 


, 7 


26 


72 


131770 


Sequence 23 from pate 


6. 35e-02 


c 


11 


26 


2. 


, 7 


54404 


5 


HS366D1 


Human DNA sequence ** 


6.35e-02 


c 


12 


26 


2. 


, 7 


54404 


128 


HS366D1 


Human DNA sequence ** 


6.35e-02 


c 


13 


25 


2 . 


, 5 


565 


68 


E04076 


gDNA encoding envelop 


2. 68e-01 




14 


25 


2 . 


.5 


1658 


97 


HUMA2PIG1 


Human alpha-2-plasmin 


2. 68e-01 


c 


15 


25 


2. 


.5 


3510 


32 


PSESDSAB11 


Pseudomonas sp . (stra 


2. 68e-01 


c 


16 


25 


2. 


,5 


3720 


102 


S57132 


COL16Al=type XVI coll 


2.68e-01 


c 


17 


25 


2. 


,5 


5387 


97 


HUMCOL16A 


Homo sapiens alpha-1 


2. 68e-01 




18 


24 


2 . 


. 4 


201 


66 


A10158 


S.griseus gene for pr 


1. 09e+00 




19 


24 


2. 


.4 


201 


66 


A10161 


Synthetic DNA for pre 


1. 09e+00 


c 


20 


24 


2, 


.4 


201 


66 


A10162 


Synthetic DNA for pre 


1.09e+00 


c 


21 


24 


2 . 


.4 


201 


66 


A10159 


S . griseus gene for pr 


1.09e+00 


c 


22 


24 


2. 


.4 


1157 


122 


HSEIR5A 


Equine herpesvirus ty 


1 .09e+00 




23 


24 


2. 


,4 


1663 


60 


MVU92534 


Mustela vison microsa 


1.09e+00 




24 


24 


2. 


.4 


1877 


108 


MUSSEQUENC 


Mus musculus HIC-5 mR 


1.09e+00 


c 


25 


24 


2 , 


. 4 


4385 


122 


HSES1RO 


Equine herpes virus t 


1.09e+00 




26 


24 


2, 


.4 


4605 


100 


HUMPREP 


Human gene for prepro 


1.09e+00 




27 


24 


2. 


.4 


4606 


102 


S73906 


adrenomedullin=potent 


1.09e+00 


c 


28 


24 


2, 


.4 


4928 


116 


EHV1SGNS 


Equine herpesvirus ty 


1.09e+00 


c 


29 


24 


2, 


.4 


38793 


18 


MTCY270 


Mycobacterium tubercu 


1.09e+00 


c 


30 


24 


2, 


.4 


38793 


31 


MTCY270 


Mycobacterium tubercu 


1.09e+00 


c 


31 


24 


2, 


.4 


95257 


45 


HS999D10 


Human DNA sequence ** 


1.09e+00 




32 


24 


2, 


.4 


139887 


96 


HUB384D8 


Chromosome 22ql3 BAC 


1.09e+00 


c 


33 


24 


2, 


.4 


144406 


44 


HS104C13 


Human DNA sequence ** 


1.09e+00 




34 


24 


2, 


.4 


150223 


122 


HSECOMGEN 


Equine herpesvirus 1 


1.09e+00 


c 


35 


24 


2, 


.4 


150223 


122 


HSECOMGEN 


Equine herpesvirus 1 


1.09e+00 




36 


23 


2. 


.3 


69 


72 


141362 


Sequence 142 from pat 


4.24e+00 


c 


37 


23 


2, 


.3 


320 


87 


HS152D3R 


H. sapiens CpG DNA, cl 


4.24e+00 




38 


23 


2. 


.3 


565 


68 


E04076 


gDNA encoding envelop 


4.24e+00 




39 


23 


2, 


.3 


1595 


104 


MMNCAM1B 


Mouse mRNA for neural 


4 .24e+00 


c 


40 


23 


2, 


.3 


1966 


106 


MUSCALCHAN 


Mus musculus L-type c 


4.24e+00 


c 


41 


23 


2, 


.3 


2500 


76 


D86598 


Norway spruce mRNA fo 


4.24e+00 




42 


23 


2, 


.3 


4552 


19 


MMNCAMR3 


Mouse mRNA for 3 ' -end 


4 .24e+00 




43 


23 


2, 


.3 


4552 


104 


MMNCAMR3 


Mouse mRNA for 3 '-end 


4.24e+00 




44 


23 


2, 


.3 


33769 


5 


HSAC2100 


Genomic sequence from 


4 .24e+00 




45 


23 


2, 


.3 


33769 


86 


AC002100 


Genomic sequence from 


4.24e+00 



ALIGNMENTS 

RESULT 1 

LOCUS 131750 2420 bp DNA PAT 20-DEC-1996 

DEFINITION Sequence 3 from patent US 5583016. 



ACCESSION 131750 
NID gl822541 
KEYWORDS 

SOURCE Unknown. 
ORGANISM Unknown. 

Unclassified . 
REFERENCE 1 (bases 1 to 2420) 

AUTHORS Villeponteau, B. , Feng, J., Funk,W. and Andrews, W.H. 
TITLE Mammalian telomerase 
JOURNAL Patent: US 5583016-A 3 10-DEC-1996; 
FEATURES Location /Qualifiers 

source 1 . .2420 

/organism= "unknown" 
BASE COUNT 620 a 572 c 647 g 581 t 

ORIGIN 

Query Match 95.1%; Score 933; DB 72; Length 2420; 

Best Local Similarity 99,1%; Pred. . C.OCc+CC; 

Matches 972; Conservative 0; Mismatches 3; Indels 6; Gaps 5 

ctgcagaggatagaaaaaaggccctctgatacctcaagttagtttcacctttaaagaagg 1253 

I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
CTGCAGAGGATAGAAAAAAGWCCCTCTGATACCTCAAGTTAGTTTCACCTTTAAAGAAGG 60 

tcggaagtaaagacgcaaagcctttcccggacgtgcggaagggcaacgtccttcctcatg 1313 

I I I i I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I f I I I I I I I I I I 
TCGGAAGTAAAGACGCAAAGCCTTTCCCGGACGTGCGGAAGGGCAACGTCCTTCCTCATG 120 

gccggaaatggaactttaatttcccgttccccccaaccagcccgcccgagagagtgactc 1373 

I I I I I I I II I t I I I I ! I I I I I I I I I I I I I I I I If II I I I I I I I I I I I I I I I I II I I I I I I 
GCCGGAAATGGAACTTTAATTTCCCGTTCCCCCCAACCAGCCCGCCCGAGAGAGTGACTC 180 

tcacgagagccgcgagagtcagcttggccaatccgtgcggtcggcggccgctccctttat 1433 

I I I I I I I I I I I I I I I I I t I I I I I I I I I I 1 I I I 1 I I II I I I I I I I I I I I I I I M I I I I I I I 
TCACGAGAGCCGCGAGAGTCAGCTTGGCCAATCCGTGCGGTCGGCGGCCGCTCCCTTTAT 240 

aagccgactcgcccggcagcgcaccgggttgcggagggtgggcctgggaggggtggtggc 14 93 

I I II I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I M I I I I I I II I I I I I I 
AAGCCGACTCGCCCGGCAGCGCACCGGGTTGCGGAGGGTGGGCCTGGGAGGGGTGGTGGC 300 

cattttttgtctaaccctaactgagaagggcgtaggcgccgtgcttttgctccccgcgcg 1553 

I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I II I I f I I I i I M I II I M I 
CATTTTTTGTCTAACCCTAACTGAGAAGGGCGTAGGCGCCGTGCTTTTGCTCCCCGCGCG 360 

ctgtttttctcgctgactttcagcgggcggaaaagcctcggcctgccgccttccaccgtt 1613 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I 
CTGTTTTTCTCGCTGACTTTCAGCGGGCGGAAAAGCCTCGGCCTGCCGCCTTCCACCGTT 420 

cattctagagcaaacaaaaaatgtcagctgctggcccgttcgcc--tcccggggacctgc 1671 

I I I I I 1 I I I I I I I I II i I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
CATTCTAGAGCAAACAAAAAATGTCAGCTGCTGGCCCGTTCGCCCCTCCCGGGGACCTGC 480 

ggcgggtcgcctgcccagcccccgaaccccgcctggag-ccgcggtcggcccggggcttc 1730 

I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 1 I I I I I I I I I I I 
GGCGGGTCGCCTGCCCAGCCCCCGAACCCCGCCTGGAGGCCGCGGTCGGCCCGGGGCTTC 540 

tccggaggcacccactgccaccgcgaagagttgggctctgtcagccgcgggtctctcggg 1790 

I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I 
TCCGGAGGCACCCACTGCCACCGCGAAGAGTTGGGCTCTGTCAGCCGCGGGTCTCTCGGG 600 

ggcgagggcgaggttcac-cgtttcaggccgcaggaagaggaacggagcgagtccc-gc- 1847 

I I I I I I I I I I I I I I I t I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I f I I I I I II 
GGCGAGGGCGAGGTTCAGGCCTTTCAGGCCGCAGGAAGAGGAACGGAGCGAGTCCCCGCG 660 

cgcggcgcgattccctgagctgtgggacgtgcacccaggactcggctcacacatgcagtt 1907 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I II I I I I I I I I I 1 I 



Db 


1194 


Qy 


1 


Db 


1254 


Qy 


61 


Db 


1314 


Qy 


121 


Db 


1374 


Qy 


181 


Db 


1434 


Qy 


241 


Db 


1494 


Qy 


301 


Db 


1554 


Qy 


361 


Db 


1614 


Qy 


421 


Db 


1672 


Qy 


481 


Db 


1731 


Qy 


541 


Db 


1791 


Qy 


601 


Db 


1848 



3 



Qy 661 CGCGGCGCGATTCCCTGAGCTGTGGGACGTGCACCCAGGACTCGGCTCACACATGCAGTT 720 

Db 1908 cgctttcctgttggtggggggaacgccgatcgtgcgcatccgtcacccctcgccggcagt 1967 

I I ! I I I I I I I I I I I M I I I I I I I I I I I II I I I I M I I I I I I I I I I M I I I M I I I ! I I I 1 
Qy 721 CGCTTTCCTGTTGGTGGGGGGAACGCCGATCGTGCGCATCCGTCACCCCTCGCCGGCAGT 780 

Db 1968 gggggcttgtgaacccccaaacctgactgactgggccagtgtgctgcaaattggcaggag 2027 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Qy 781 GGGGGCTTGTGAACCCCCAAACCTGACTGACTGGGCCAGTGTGCTGCAAATTGGCAGGAG 840 

Db 2028 acgtgaaggcacctccaaagtcggccaaaatgaatgggcagtgagccggggttgcctgga 2087 

M f f I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I M 
Qy 841 ACGTGAAGGCACCTCCAAAGTCGGCCAAAATGAATGGGCAGTGAGCCGGGGTTGCCTGGA 900 

Db 2088 gccgttcctgcgtgggttctcccgtcttccgctttttgttgccttttatggttgtattac 2147 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I 1 If I I I I I I I It t I I 
Qy 901 GCCGTTCCTGCGTGGGTTCTCCCGTCTTCCGCTTTTTGTTGCCTTTTATGGTTGTATTAC 960 

Db 2148 aacttagttcctgctctgcag 2168 

I I II I I I I I I I! I I I I I I I I I 
Qy 961 AACTTAGTTCCTGCTCTGCAG 981 

RESULT 2 

ID S79400 backbone; DNA; HUM; 962 BP. 

AC S79400; 

NI gl086943 

DT 04-DEC-1995 (Rel. 45, Created) 

DT 30-JAN-1997 (Rel. 50, Last updated, Version 3) 

DE hTR=RNA component of telomerase [human, 293 cells, Genomic, 962 

DE nt] . 

KW 

OS Homo sapiens (human) 

OC Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 

OC Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

RN [1] 

RP 1-962 

RX MEDLINE; 95381057. 

RA Feng J., Funk W.D., Wang S.S., Weinrich S.L., Avilion A. A., 

RA Chiu CP., Adams R.R., Chang E., Allsopp R.C., Yu J., Le S., 

RA West M.D., Harley C.B., Andrews W.H., Greider C.W., 

RA Villeponteau B.; 

RT "The RNA component of human telomerase"; 

RL Science 269:1236-1241(1995). 

CC NCBI gi: 1086943 

FH Key Location/Qualifiers 
FH 

FT source 1 . . 962 

FT /organism="Homo sapiens" 

FT /note= "human" 

FT misc_RNA 1..962 
FT /partial 
FT /gene="hTR" 

FT /note="telomerase RNA; Description: RNA component of 

FT telomerase" 

SQ Sequence 962 BP; 178 A; 257 C; 288 G; 239 T; 0 other; 

Query Match 68.3%; Score 670; DB 142; Length 962; 

Best Local Similarity 98.9%; Pred. No. 0.00e+00; 

Matches 708; Conservative 0; Mismatches 2; Indels 6; Gaps 5; 

Db 1 gggttgcggagggtgggcctgggaggggtggtggccattttttgtctaaccctaactgag 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Qy 266 GGGTTGCGGAGGGTGGGCCTGGGAGGGGTGGTGGCCATTTTTTGTCTAACCCTAACTGAG 325 

Db 61 aagggcgtaggcgccgtgcttttgctccccgcgcgctgtttttctcgctgactttcagcg 120 



4 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 



Qy 


326 


AAGGGCGTAGGCGCCGTGCTTTTGCTCCCCGCGCGCTGTTTTTCTCGCTGACTTTCAGCG 


385 


Db 


121 


ggcggaaaagcctcggcctgccgccttccaccgttcattctagagcaaacaaaaaatgtc 


180 






II ) 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 i 1 M 1 




Qy 


386 


GGCGGAAAAGCCTCGGCCTGCCGCCTTCCACCGTTCATTCTAGAGCAAACAAAAAATGTC 


445 


Db 


181 


agctgctggcccgttcgcc — tcccggggacctgcggcgggtcgcctgcccagcccccga 


238 






1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 




Qy 


446 


AGCTGCTGGCCCGTTCGCCCCTCCCGGGGACCTGCGGCGGGTCGCCTGCCCAGCCCCCGA 


505 


Db 


239 


accccgcctggag-ccgcggtcggcccggggcttctccggaggcacccactgccaccgcg 


297 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 II M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




Qy 


506 


ACCCCGCCTGGAGGCCGCGGTCGGCCCGGGGCTTCTCCGGAGGCACCCACTGCCACCGCG 


565 


Db 


298 


aagagttgggctctgtcagccgcgggtctctcgggggcgagggcgaggttcac-cgtttc 


356 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 




Qy 


566 


AAGAGTTGGGCTCTGTCAGCCGCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTC 




Db 


357 


aggccgcaggaagaggaacggagcgagtccc-gc-cgcggcgcgattccctgagctgtgg 


414 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 i 1 t 1 1 1 1 1 1 II 1 1 1 1 1 




Qy 


626 


AGGCCGCAGGAAGAGGAACGGAGCGAGTCCCCGCGCGCGGCGCGATTCCCTGAGCTGTGG 


685 


Db 


415 


gacgtgcacccaggactcggctcacacatgcagttcgctttcctgttggtggggggaacg 


474 






1 1 1 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 f 1 1 1 I 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 1 M t 1 1 1 1 




Qy 


686 


GACGTGCACCCAGGACTCGGCTCACACATGCAGTTCGCTTTCCTGTTGGTGGGGGGAACG 


745 


Db 


475 


ccgatcgtgcgcatccgtcacccctcgccggcagtgggggcttgtgaacccccaaacctg 


534 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 




Qy 


746 


CCGATCGTGCGCATCCGTCACCCCTCGCCGGCAGTGGGGGCTTGTGAACCCCCAAACCTG 


805 


Db 


535 


actgactgggccagtgtgctgcaaattggcaggagacgtgaaggcacctccaaagtcggc 


594 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Qy 


806 


ACTGACTGGGCCAGTGTGCTGCAAATTGGCAGGAGACGTGAAGGCACCTCCAAAGTCGGC 


865 


Db 


595 


caaaatgaatgggcagtgagccggggttgcctggagccgttcctgcgtgggttctcccgt 


654 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 




Qy 


866 


CAAAATGAATGGGCAGTGAGCCGGGGTTGCCTGGAGCCGTTCCTGCGTGGGTTCTCCCGT 


925 


Db 


655 


cttccgctttttgttgccttttatggttgtattacaacttagttcctgctctgcag 710 








1 M 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Qy 


926 


CTTCCGCTTTTTGTTGCCTTTTATGGTTGTATTACAACTTAGTTCCTGCTCTGCAG 981 





RESULT 3 

LOCUS HSU85256 598 bp DNA PRI ll-APR-1997 

DEFINITION Human telomerase RNA, partial sequence. 

ACCESSION U85256 

NID gl932797 

KEYWORDS 

SOURCE human . 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
• Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 
Homo . 

REFERENCE 1 (bases 1 to 598) 

AUTHORS Bryan, T.M., Marusic,L., Bacchetti , S . , Namba,M. and Reddel,R.R. 

TITLE The telomere lengthening mechanism in telomerase-negative immortal 
human cells does not involve the telomerase RNA subunit 

JOURNAL Hum. Mol . Genet. (1997) In press 
REFERENCE 2 (bases 1 to 598) 

AUTHORS Bryan, T . M . and Reddel,R.R. 

TITLE Direct Submission 

JOURNAL Submitted ( 13- JAN-1997 ) Children's Medical Research Institute, 214 
Hawkesbury Rd., Westmead, NSW 2145, Australia 
REFERENCE 3 (bases 1 to 598) 



5 



AUTHORS 

TITLE 

JOURNAL 

REMARK 
FEATURES 

source 



misc RNA 



BASE COUNT 
ORIGIN 



Bryan, T.M. and Reddel,R.R. 
Direct Submission 

Submitted ( 10-APR-1997) Children's Medical Research Institute, 214 
Hawkesbury Rd., Westmead, NSW 2145, Australia 
Sequence update by submitter 

Location/Qualifiers 

1. .598 

/organism="Homo sapiens" 

/cell_line=" four immortal human cell lines (293, SUSM-1, 
GM847, GM639)" 
/chromosome="3q" 
1. .>598 

/note="hTR; RNA component of telomerase; four sequence 
differences found as compared to the sequence of earlier 
submission of GenBank Accession Number S79400" 
/product="telomerase RNA" 
93 a 191 c 203 g 111 t 



Query Match 
Best Local Similarity 

597 ; Conservative 



Matches 


Db 


i 


Qy 


266 


Db 


61 


Qy 


326 


Db 


121 


Qy 


386 


Db 


181 


Qy 


446 


Db 


241 


Qy 


506 


Db 


301 


Qy 


566 


Db 


361 


Qy 


626 


Db 


421 


Qy 


686 


Db 


481 


Qy 


746 


Db 


541 


Qy 


806 



60.8%; Score 596; DB 96; Length 598; 
99.8%; Pred. No. 0.00e+00; 

0; Mismatches 1; Indels 0; 



Gaps 



gggttgcggagggtgggcctgggaggggtggtggccattttttgtctaaccctaactgag 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I IJ I I I I I I I I I I I I I I I i I I 
GGGTTGCGGAGGGTGGGCCTGGGAGGGGTGGTGGCCATTTTTTGTCTAACCCTAACTGAG 325 



aagggcgtaggcgccgtgcttttgctccccgcgcgctgtttttctcgctgactttcagcg 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II I I I I I I I I I I 
AAGGGCGTAGGCGCCGTGCTTTTGCTCCCCGCGCGCTGTTTTTCTCGCTGACTTTCAGCG 

ggcggaaaagcctcggcctgccgccttccaccgttcattctagagcaaacaaaaaatgtc 
I I I I I I I I I I I I ! I I I I I I M I I I I I I I I I I I I I I I I I I I I I i I I I I I I M I I I I I I I I I 
GGCGGAAAAGCCTCGGCCTGCCGCCTTCCACCGTTCATTCTAGAGCAAACAAAAAATGTC 

agctgctggcccgttcgcccctcccggggacctgcggcgggtcgcctgcccagcccccga 

M I I t I t M I I t I I I I I I I 1 I M I I I I II I I I I I II I I I I M I II 1 I I M I I I I I I II I I 
AGCTGCTGGCCCGTTCGCCCCTCCCGGGGACCTGCGGCGGGTCGCCTGCCCAGCCCCCGA 

accccgcctggaggccgcggtcggcccggggcttctccggaggcacccactgccaccgcg 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I I I I I I M 
ACCCCGCCTGGAGGCCGCGGTCGGCCCGGGGCTTCTCCGGAGGCACCCACTGCCACCGCG 

aagagttgggctctgtcagccgcgggtctctcgggggcgagggcgaggttcaggcctttc 

I I I I I I I I I I It I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I II I M I I I I I I M I I 
AAGAGTTGGGCTCTGTCAGCCGCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTC 

aggccgcaggaagaggaacggagcgagtccccgcgcgcggcgcgattccctgagctgtgg 
I I I I M II I I I I I I I II I I I I I ! I I I I I I I I I I I I M M I I I M I I I I I I I I I I I I I I I I 
AGGCCGCAGGAAGAGGAACGGAGCGAGTCCCCGCGCGCGGCGCGATTCCCTGAGCTGTGG 



120 



385 



180 



445 



240 



505 



300 



565 



360 



625 



420 



685 



480 



gacgtgcacccaggactcggctcacacatgcagttcgctttcctgttggtggggggaacg 
I 1 I I I I I M I I ! I I 1 M I I I I I M I I I I I I I I I 11 I I I I I I I I I I i I I I I I I I I M I M I 
GACGTGCACCCAGGACTCGGCTCACACATGCAGTTCGCTTTCCTGTTGGTGGGGGGAACG 7 45 

ccgatcgtgcgcatccgtcacccctcgccggcaatgggggcttgtgaacccccaaacctg 540 

I I M I I I I I I 1 i I I I I I I I I I I I I I I I M I M I I i I I I I I I I I I I I I I I I II I i I I I I I 
CCGATCGTGCGCATCCGTCACCCCTCGCCGGCAGTGGGGGCTTGTGAACCCCCAAACCTG 805 

actgactgggccagtgtgctgcaaattggcaggagacgtgaaggcacctccaaagtcg 598 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I 
ACTGACTGGGCCAGTGTGCTGCAAATTGGCAGGAGACGTGAAGGCACCTCCAAAGTCG 863 



RESULT 
LOCUS 



131748 



554 bp 



DNA 



PAT 



20-DEC-1996 



6 



DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



Sequence 

131748 

gl822539 



1 from patent US 5583016. 



Unknown . 
Unknown . 
Unclassified. 
1 (bases 1 to 554) 
Villeponteau, B. , Feng, J. 
Mammalian telomerase 
Patent: US 5583016-A 1 10-DEC-1996 

Location/Qualifiers 

1. .554 

/organism= "unknown" 
80 a 179 c 189 g 



Funk,W. and Andrews, W.H. 



106 t 



Query Match 52.4%; 
Best Local Similarity 98.6%; 

552; Conservative 



Matches 


Db 


l 


Qy 


266 


Db 


61 


Qy 


326 


Db 


121 


Qy 


386 


Db 


181 


Qy 


446 


Db 


239 


Qy 


506 


Db 


298 


Qy 


566 


Db 


357 


Qy 


626 


Db 


415 


Qy 


686 


Db 


475 


Qy 


746 


Db 


535 


Qy 


806 



Score 514: DB 72; length 554; 
Pred. No. 0.00e+00; 
0; Mismatches 2; Indels 



6; Gaps 5; 



gggttgcggagggtgggcctgggaggggtggtggccattttttgtctaaccctaactgag 60 

I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
GGGTTGCGGAGGGTGGGCCTGGGAGGGGTGGTGGCCATTTTTTGTCTAACCCTAACTGAG 325 

aagggcgtaggcgccgtgcttttgctccccgcgcgctgtttttctcgctgactttcagcg 120 

II I I t I I t I I I I I I I I I I I I I I I I I I I I I I ! I I ! I I I M I t I I I I I I I I I I I I I I M M I 
AAGGGCGTAGGCGCCGTGCTTTTGCTCCCCGCGCGCTGTTTTTCTCGCTGACTTTCAGCG 385 

ggcggaaaagcctcggcctgccgccttccaccgttcattctagagcaaacaaaaaatgtc 180 

I I I I I I I I M I I I I I I i I I I I I M I I I I I I I I M II I I M I M I I i I I I I I I I I I I I I I I 
GGCGGAAAAGCCTCGGCCTGCCGCCTTCCACCGTTCATTCTAGAGCAAACAAAAAATGTC 445 



agctgctggcccgttcgcc — tcccggggacctgcggcgggtcgcctgcccagcccccga 
I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I 1 I I I I I I I I I ! I I I I I 
AGCTGCTGGCCCGTTCGCCCCTCCCGGGGACCTGCGGCGGGTCGCCTGCCCAGCCCCCGA 



238 



505 



297 



accccgcctggag-ccgcggtcggcccggggcttctccggaggcacccactgccaccgcg 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

ACCCCGCCTGGAGGCCGCGGTCGGCCCGGGGCTTCTCCGGAGGCACCCACTGCCACCGCG 565 

aagagttgggctctgtcagccgcgggtctctcgggggcgagggcgaggttcac-cgtttc 356 

I I I I I I I I I I ! I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M II I I I I I I I I I 
AAGAGTTGGGCTCTGTCAGCCGCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTC 625 

aggccgcaggaagaggaacggagcgagtccc-gc-cgcggcgcgattccctgagctgtgg 414 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I M I I I N I M I I I ! I I I i I I I 
AGGCCGCAGGAAGAGGAACGGAGCGAGTCCCCGCGCGCGGCGCGATTCCCTGAGCTGTGG 685 

gacgtgcacccaggactcggctcacacatgcagttcgctttcctgttggtggggggaacg 474 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GACGTGCACCCAGGACTCGGCTCACACATGCAGTTCGCTTTCCTGTTGGTGGGGGGAACG 7 45 

ccgatcgtgcgcatccgtcacccctcgccggcagtgggggcttgtgaacccccaaacctg 534 

I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I t I I I I I i I I I I I I I I I I I I I I I I I I I I 
CCGATCGTGCGCATCCGTCACCCCTCGCCGGCAGTGGGGGCTTGTGAACCCCCAAACCTG 805 

actgactgggccagtgtgct 554 

I I I I I I I I I I I I I I I I ! I I I 
ACTGACTGGGCCAGTGTGCT 825 



RESULT 5 

LOCUS HSU86046 451 bp DNA PRI 27-JAN-1997 

DEFINITION Human telomerase RNA (hTR) gene sequence. 
ACCESSION U86046 S79400 



7 



NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 



misc RNA 



BASE COUNT 
ORIGIN 



gl800150 
human . 

Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 451) 

Feng, J. , Funk,W.D., Wang, S.-S., Weinrich, S . L. , Avilion, A. A. , 
Chiu,C.-P., Adams, R.R., Chang, E., Allsopp, R. C . , Yu,J., Le,S., 
West,M.D., Harley,C.B., Andrews, W. H. , Greider,C.W. and 
Villeponteau, B. 

The RNA component of human telomerase 
Science 269 (5228), 1236-1241 (1995) 
95381057 

2 (bases 1 to 451) 
Villeponteau, B. and Andrews, W.H. 
Direct Submission 

Submitted { 1 6-AUG-l 99^} Willi?™ H. *r.drev:s, Mclocuior Biology, 
Geron Corporation, 200 Constitution Drive, Menlo Park, CA 94025, 
USA 

Location/Qualifiers 
1. .451 

/organism="Homo sapiens" 
1. .451 

/note="Description : human telomerase RNA" 
/gene="hTR" 
1. .451 
/gene="hTR" 

/note="Description : human telomerase RNA" 
65 a 148 c 156 g 82 t 



Query Match 



4 6.0%; 



Best Local Similarity 100.0%; 

451; Conservative 



Matches 


Db 


1 


Qy 


266 


Db 


61 


Qy 


326 


Db 


121 


Qy 


386 


Db 


181 


Qy 


446 


Db 


241 


Qy 


506 


Db 


301 


Qy 


566 


Db 


361 


Qy 


626 


Db 


421 



Score 451; DB 96; Length 451; 
Pred. No. 0.00e+00; 
0; Mismatches 0; Indels 0; 



Gaps 



gggttgcggagggtgggcctgggaggggtggtggccattttttgtctaaccctaactgag 60 

I I I i I I I I I I I I I II I I I I M I I I I I I I I I I I I I I ! I I I I I I I I I I I I I 1 I I I I I I I ! I I 
GGGTTGCGGAGGGTGGGCCTGGGAGGGGTGGTGGCCATTTTTTGTCTAACCCTAACTGAG 325 

aagggcgtaggcgccgtgcttttgctccccgcgcgctgtttttctcgctgactttcagcg 120 

I I I I I II I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I II I I I I I I t I I I I I I t I I I M II I 
AAGGGCGTAGGCGCCGTGCTTTTGCTCCCCGCGCGCTGTTTTTCTCGCTGACTTTCAGCG 385 

ggcggaaaagcctcggcctgccgccttccaccgttcattctagagcaaacaaaaaatgtc 180 
I I I I I I I! I I I i M I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I M I I I 
GGCGGAAAAGCCTCGGCCTGCCGCCTTCCACCGTTCATTCTAGAGCAAACAAAAAATGTC 445 

agctgctggcccgttcgcccctcccggggacctgcggcgggtcgcctgcccagcccccga 240 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCTGCTGGCCCGTTCGCCCCTCCCGGGGACCTGCGGCGGGTCGCCTGCCCAGCCCCCGA 505 

accccgcctggaggccgcggtcggcccggggcttctccggaggcacccactgccaccgcg 300 
I I I I I I I I I I I I I I I (I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
ACCCCGCCTGGAGGCCGCGGTCGGCCCGGGGCTTCTCCGGAGGCACCCACTGCCACCGCG 565 

aagagttgggctctgtcagccgcgggtctctcgggggcgagggcgaggttcaggcctttc 360 
I I I I I I I I I I I I I I I I I II I I 11 I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II 
AAGAGTTGGGCTCTGTCAGCCGCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTC 625 

aggccgcaggaagaggaacggagcgagtccccgcgcgcggcgcgattccctgagctgtgg 420 

I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I ( I I I I I I M I I I I I M I I I I I I 
AGGCCGCAGGAAGAGGAACGGAGCGAGTCCCCGCGCGCGGCGCGATTCCCTGAGCTGTGG 685 



I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 



8 



Qy 



686 GACGTGCACCCAGGACTCGGCTCACACATGC 716 



6 



RESULT 
LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



MMU33831 
Mus musculus 
U33831 
g!000197 



560 bp DNA ROD 
telomerase RNA component gene. 



28-SEP-1995 



snRNA 



misc feature 



BASE COUNT 
ORIGIN 



house mouse. 
Mus musculus 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Rodentia; Sciurognathi ; Myomorpha; Muridae; 
Murinae; Mus. 

1 (bases 1 to 560) 

Blasco,M.A., Funk,W., Villeponteau, B . and Greider,C.W. 
Functional characterization and developmental regulation of mouse 

t Q.l nmAT-fl s a RMS 

Science 269 (5228), 1267-1270 (1995) 
95381063 

2 (bases 1 to 560) 

Blacso,M.A., Funk,W., Villeponteau, B . and Greider,C.W. 
Direct Submission 

Submitted ( 12-AUG-1995) Carol W. Greider, Cold Spring Harbor 
Laboratory, PO Box 100, 1 Bungtown Road, Cold spring Harbor, NY 
11724, USA 

Location/Qualifiers 

1. .560 

/organism="Mus musculus" 

/ no te="initially cloned from D3 embryonic stem cell 

library" 

25. .>453 

/note="The RNA is approximately 430 nucleotides long. The 
5 ? end has been determined, but 3' end has not been 
precisely mapped. Based on RT-PCR analysis and sequence 
conservation with the human homolog the 3 1 end is near 
position 453 in this sequence" 
/product="telomerase RNA component" 
60. . 68 

/note="The template region of the RNA specifies the 
synthesis of the telomere sequence TTAGGG" 
90 a 183 c 188 g 99 t 



Query Match 6.6%; 
Best Local Similarity 68.8%; 
Matches 256; Conservative 



Score 65; DB 105; Length 560; 
Pred. No. 1.01e-32; 
0; Mismatches 101; Indels 15; 



Gaps 12; 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



95 cttttgttctccgcccgctgtttttctcgctgacttccagcgggccaggaaagtccagac 

I I II I I II IN! I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
344 CTTTTGCTCCCCGCGCGCTGTTTTTCTCGCTGACTTTCAGCGGGCG-GAAAAGCCTCGGC 



155 



403 



ctgcagcgggccacccggcgttcccgagcct-caaaaacaaacgtcagcgcaggagctcc 

I I I I I I I I M I I III I I I I I 1 M I I I III Ml 
CTGCCGCCTTCCACCGTTCATTCTAGAGCAAACAAAAAATGTCAGCTGCTGGCCCGTTCG 



154 



402 



213 



4 62 



214 aggttcgccgggagctccgcggcgccgggccgcccagtcccgtaccc-gcctacaggccg 272 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

4 63 CCCCTCCCGGGGACCTGCGGCGGGTCGCCTGCCCAGCCCCCGAACCCCGCCTGGAGGCCG 522 

273 cggccggcctgggg—tcttagga — ctccg-ctgccgccgcgaagagctccgcctctgt 327 

- Ml I I I M I I I I Ml III I II MMI I I I I I I I I M I II I I I I I 
523 CGGTCGGCCCGGGGCTTCTCCGGAGGCACCCACTGCCACCGCGAAGAGTTGGGC-TCTGT 581 

328 cagccgcggg-cgcgcgggggctggggccaggc-cgggcgag-c--gccgcgaggacagg 382 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill I I I I I 



9 



Qy 582 CAGCCGCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTCAGGCCGCAGGAAGAGG 641 



Db 383 aatggaactggtccccgtgttcggtgtc-ttacctgagctgtgggaagtgcacccggaac 441 

Mill! I I I I I I I I IN I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Qy 642 AACGGAGCGAGTCCCCGCGCGCGGCGCGATTCCCTGAGCTGTGGGACGTGCACCCAGGAC 701 

Db 442 tcggttctcaca 453 

I I I I II I I I I 
Qy 702 TCGGCTCACACA 713 



RESULT 7 

LOCUS 128278 215 bp DNA PAT 30-OCT-1996 

DEFINITION Sequence 5 from patent US 5569830. 

ACCESSION 128278 

NID gl819054 

KEYWORDS 

SOURCE Unknown. 
ORGANISM Unknown. 

Unclassified . 
REFERENCE 1 {bases 1 to 215) 

AUTHORS Bennett, A. , Labavitch, J.M. , Powell, A. and Stotz,H. 

TITLE Plant inhibitors of fungal polygalacturonases and their use to 

control fungal disease 
JOURNAL Patent: US 5569830-A 5 29-OCT-1996; 
FEATURES Location/Qualifiers 
source 1 . . 215 

/organism^ "unknown" 
BASE COUNT 15 a 8 c 25 g 26 t 141 others 

ORIGIN 

Query Match 3.9%; Score 38; DB 71; Length 215; 

Best Local Similarity 13.6%; Pred. No. 2.28e-10; 

Matches 24; Conservative 79; Mismatches 71; Indels 3; Gaps 3; 

Db 6 mssssvvsrtascndkakkdgnttsswttdccnrtwgvcdtdttyrvnndsghnkyssan 65 

:::::::: I : I : : : I I : : I I I : : I : : : : : : I : : : : : 
Qy 465 CCTCCCGGGGACCTGCGGCGGGTCGCCTGCCCAGCCCCCGAACCCCGCCTGGAGGCCGCG 524 

Db 66 ynyggnnvgaakthyythtnv-sgadsktvtdsynasgtsssnggtdgnrsgadsygssk 124 

Qy 525 GTCGGCCCGGGGCTTCTCCGGAGGCACCCACTGCCACC-GCGAAGAGTTGGGCTCTGTCA 583 

Db 125 tamtsrnrtgk-tannavdsrnmgdasvgsdkntkkhaknsadgkvgsknngdrnnr 180 

Qy 584 GCCGCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTCAGGCCGCAGGAAGAG 640 



8 



RESULT 
LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 

BASE COUNT 



128278 
Sequence 
128278 
gl819054 



215 bp DNA 
from patent US 5569830. 



PAT 



30-OCT-1996 



Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 215) 

Bennett, A., Labavitch, J. M. , Powell, A. andStotz,H. 

Plant inhibitors of fungal polygalacturonases and their use 

control fungal disease 

Patent: US 5569830-A 5 29-OCT-1996; 

Location /Qualifiers 

1. .215 

/organism= "unknown" 
15 a 8c 25 g 26 t 141 others 



to 



10 



ORIGIN 



Query Match 3.6%; Score 35; DB 71; Length 215; 

Best Local Similarity 16.5%; Pred. No. 3.99e-08; 



Matches 


34; Conservative 77; Mismatches 93; Indels 2; Gaps 


Db 


10 


svvsrtascndkakkdgnttsswttdccnrtwgvcdtdttyrvnndsghnkyssanynyg 


69 


cp 




uoLLuri^ ill uunuo luLL 1 1 ^rt^o 1 \* 1 1 O^^-rtrll 1 1 UurtuUn^n^ l Ow^v^^-rtu i ^rtVj 


807 


Db 


70 


gnnvgaakthyythtnvsgadsktvtdsynasgtsssnggtdgnrsgadsygssktamts 


129 


Cp 


806 


: ( : : I : : | : : : : : : : I : : 1 1 : 1 : 1 t : : : 1 : t : 
TCAGGTTTGGGGGTTCACAAGCCCCCACTGCCGGCGAGGGGTGACGGATGCGCACGATCG 


747 


Db 


130 


rnrtgktannavdsrnm-gdasvgsdkntkkhaknsadgkvgsknngdrnnrygtgtksn 
: :| : :: : |:| :|:: :::|: : : ::|: I: : III : 
GCGTTCCCCCCACCAACAGGAAAGCGAACTGCAT-GTGTGAGCCGAGTCCTGGGTGCACG 


188 


Cp 


746 


688 


ub 


iay 


vsnncgggnkrdvssyannkccgssc 214 
: | : : : : : | : | : : I 




Cp 


687 


TCCCACAGCTCAGGGAATCGCGCCGC 662 





RESULT 9 
LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



131769 30 bp DNA PAT 20-DEC-1996 

Sequence 22 from patent US 5583016. 

131769 

gl822560 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 30) 

Villeponteau, B . , Feng, J. , Funk,W. and Andrews, W.H. 
Mammalian telomerase 

Patent: US 5583016-A 22 10-DEC-1996; 
Location/Qualifiers 
1. .30 

/organism^ "unknown" 
13 a 6c 11 g 0 t 



Query Match 3.1%; 
Best Local Similarity 100.0%; 
Matches 30; Conservative 



Score 30; DB 72; Length 30; 
Pred. No. 1.45e-04; 
0; Mismatches 0; Indels 



0; Gaps 0; 



Db 1 gagaaaaacagcgcgcggggagcaaaagca 30 

I I I I I I I I I I II I I I II I I I II I I I I I I I I 
Cp 371 GAGAAAAACAGCGCGCGGGGAGCAAAAGCA 342 
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RESULT 
LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



131770 
Sequence 
131770 
gl822561 



26 bp 
23 from patent 



DNA 

US 5583016. 



PAT 



20-DEC-1996 



26) 



Unknown . 
Unknown . 
Unclassified. 
1 (bases 1 to 
Villeponteau, B. 
Mammalian, telomerase 

Patent: US 5583016-A 23 10-DEC-1996; 
Location /Qualifiers 
1..26 



Feng, J., Funk,W. and Andrews, W.H. 
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/organism="un known" 
BASE COUNT 7a 3c 9g 7 t 

ORIGIN 



Query Match 2.7%; Score 26; DB 72; Length 26; 

Best Local Similarity 100.0%; Pred. No. 6.35e-02; 

Matches 26; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Db 1 gtttgctctagaatgaacggtggaag 26 

I I I I I I I M I I I I I I I I I I I I I I I I I 
Cp 435 GTTTGCTCTAGAATGAACGGTGGAAG 410 



RESULT 11 

ID HS366D1 standard; DNA; HTG; 54404 BP. 

AC Z97986; 

NI el056722 

DT 22-JUL-1997 (Rel. 52, Created) 

DT 22- JUL- 1 997 (Rel. 52, Last updated, Version 1) 

DE Human DNA sequence *** SEQUENCING IN PROGRESS *** from clone 366D1 

KW HTG; HTGS_PHASE1. 

OS Homo sapiens (human) 

OC Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 

OC Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 

OC Homo . 

RN [1] 

RP 1-54404 

RA Barlow K. ; 
RT 

RL Submitted { 20- JUL- 1 997 ) to the EMBL/GenBank/DDBJ databases. 

RL Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 ISA, 

RL UK. E-mail enquires: humquery@sanger.ac.uk Clone requests: 

RL clone request© sange r . ac.uk 

CC IMPORTANT: This sequence is unfinished and does not necessarily 

CC represent the correct sequence. Work on the sequence is in progress 

CC and 

CC the release of this data is based on the understanding that the 

CC sequence may change as work continues. The sequence may be 

CC contaminated 

CC with foreign sequence from E.coli, yeast, vector, phage etc. 

CC Order of segments is not known; 800 n's separate segments. 

CC Unfinished sequence: c366Dl Contig_ID: 

CC Unfinished sequence: c366Dl Contig_ID: 

CC Unfinished sequence: c366Dl Contig_ID: 

CC Unfinished sequence: c366Dl Contig__ID: 

FH Key Location/Qualifiers 
FH 

FT source 1 . . 54404 

FT /organism="Homo sapiens" 

FT /clone="366Dl" 

FT /chromosome="16" 

SQ Sequence 54404 BP; 9401 A; 15916 C; 16477 G; 10164 T; 2446 other; 

Query Match 2.7%; Score 26; DB 5; Length 54404; 
Best Local Similarity 83.0%; Pred. No. 6.35e-02; 

Matches 39; Conservative 0; Mismatches 7; Indels 1; Gaps 1; 

Db 50565 cggggtggggcgggtgggcgggcgaccggccgc-ggtccccgcgagg 50610 

I I I I I I I I II I I I I I I I I I I I I I I I M I I I I I f I I I I I I 

Cp 511 CGGGGTTCGGGGGCTGGGCAGGCGACCCGCCGCAGGTCCCCGGGAGG 4 65 



00051 Length: 6814 bp 

00729 Length: 16382 bp 

00410 Length: 7092 bp 

01222 Length: 21716 bp 



RESULT 12 

LOCUS HS366D1 54404 bp DNA HTG 22-JUL-1997 

DEFINITION Human DNA sequence *** SEQUENCING IN PROGRESS *** from clone 366D1; 
HTGS phase 1. 
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ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Z97986 
g2274954 

HTG; HTGS_PHASE1. 
human . 

Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Mammalia; Eutheria; Primates; Catarrhini; Hominidae; 
Homo . 

1 (bases 1 to 54404) 

Barlow, K . 

Direct Submission 

Submitted {20- JUL-1997) Wellcome Trust Genome Campus, Hinxton, 
Cambridgeshire, CB10 ISA, UK, E-mail enquires: 

humquery@sanger.ac.uk Clone requests: clonerequest@sanger.ac.uk 
IMPORTANT: This sequence is unfinished and does not necessarily 
represent the correct sequence. Work on the sequence is in progress 
and 

the release of this data is based on the understanding that the 
sequence may change as work continues. The sequence may be 
contaminated 

with foreign sequence from E.coli, yeast, vector, phage etc. Order 
of segments is not known; 800 n's separate segments. Unfinished 
sequence: c366Dl Contig_ID: 00051 Length: 6814 bp Unfinished 
sequence: c366Dl Contig_ID: 00729 Length: 16382 bp Unfinished 
C366D1 Contig_ID: 00410 
C366D1 Contig_ID: 01222 



sequence : 
sequence : 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Length: 7092 bp Unfinished 

Length: 21716 bp. 

+ + * 



*** WARNING: Phase 1 High Throughput Genome Sequence *** 

* + * + * + 

* This sequence is unfinished. When sequencing is complete, 

* the sequence data presented in this record will be replaced 
*by a single finished sequence with the same accession number. 

Location/Qualifiers 
1. .54404 

/organism="Homo sapiens" 
/clone="366Dl" 
/ ch romos ome= " 16" 
9401 a 15916 c 16477 g 10164 t 2446 others 



Query Match 2.7%; 
Best Local Similarity 83.0%; 
Matches 39; Conservative 



Score 26; DB 128; Length 54404; 
Pred. No. 6.35e-02; 
0; Mismatches 7; Indels 1; 



Gaps 1 ; 



Db 50565 cggggtggggcgggtgggcgggcgaccggccgc-ggtccccgcgagg 50610 

I I I I I I II I I I I I I I I I I I I i I I I I I I I I I I I I I I I M I 
Cp 511 CGGGGTTCGGGGGCTGGGCAGGCGACCCGCCGCAGGTCCCCGGGAGG 4 65 



RESULT 
LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

COMMENT 
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E04076 565 bp RNA PAT 26-NOV-1996 

gDNA encoding envelope region of type C hepatitis virus. 
E04076 
g2172286 

JP 1992349885-A/l. 
Hepatitis C virus. 
Hepatitis C virus 

Viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae ; 
Hepatitis C-like viruses. 
1 (bases 1 to 565) 

Tsutae,M., Kazuaki,C, Hiromitsu,K. andYataro,I. . 

NUCLEIC ACID FRAGMENT WITHIN ENVELOPE REGION OF HEPATITIS C VIRUS 

AND METHOD FOR DETECTING THE SAME 

Patent: JP 1992349885-A 1 04-DEC-1992; 

TEIJIN LTD 

OS Hepatitis C virus 
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FEATURES 

source 

BASE COUNT 
ORIGIN 



PN JP 1992349885-A/l 
PD 04-DEC-1992 
PF 29-MAY-1991 JP 1991152169 
PI MORI NAG A TSUTAE, CHAYAMA KAZUAKI , 
ICHIKAWA YATARO 

PC C12N15/10,C12Ql/68,C12Ql/70//C12N15/ll; 
CC strancledness : Single; 
CC topology: Linear; . 

Location/Qualifiers 
1. .565 

/organism="Hepatitis C virus" 
60 a 93 c 107 g 85 t 



KUMADA HIROMITSU, PI 



220 others 



Query Match 2.5%; Score 25; DB 68; Length 565; 

Best Local Similarity 20.3%; Pred. No. 2.68e-01; 

Matches 26; Conservative 49; Mismatches 53; Indels 0; Gaps 0; 

Db 199 rargyrrmggayryvatcmtgcwydyyccbgggtgyrybccytgygtycgsgarrryrry 258 

: i : I : : : : : : : : I I : : I I : j I I I I : : : : I : : : I : : : : : : 

Cp 726 AAAGCGAACTGCATGTGTGAGCCGAGTCCTGGGTGCACGTCCCACAGCTCAGGGAATCGC 667 

Db 259 nnytcbmgntgytgggyrgcgctbacycccacgstygcvrcyagrrayvbyavyvtyccc 318 

: I : : I I : : I : I : I I : I : I : : : : : I : : : t : t I 

Cp 666 GCCGCGCGCGGGGACTCGCTCCGTTCCTCTTCCTGCGGCCTGAAAGGCCTGAACCTCGCC 607 

Db 319 rcbrygvm 326 

Cp 606 CTCGCCCC 599 
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RESULT 
LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 



SEGMENT 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
COMMENT 

FEATURES 

source 



TATA signal 



mtron 



HUMA2PIG1 1658 bp DNA PRI 30-OCT-1994 

Human alpha-2-plasmin inhibitor gene, exon 1. 
M20781 J03830 
gl77879 

alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease 
inhibitor; serpin. 
1 of 6 

Human placenta DNA, clones lambda-PI [ 1, 2 , 6] . 
Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1658) 

Hirosawa,S., Nakamura,Y., Miura,0., Sumi,Y. andAoki,N. 
Organization of the human alpha 2-plasmin inhibitor gene [published 
erratum appears in Proc Natl Acad Sci U S A 1989 Mar; 86 ( 5) : 1612-13] 
Proc. Natl. Acad. Sci. U.S.A. 85 (18), 6836-6840 (1988) 
88320531 

Draft entry and computer-readable sequence for [1] kindly submitted 
by N.Aoki, 28-SEP-1988. 

Location/Qualifiers 

1. .1658 

/organism="Homo sapiens" 
/ t is sue_type= "placenta" 
/map="18pll.l-qll.2" 
1070. .1075 
/gene="PLI" 
/note="G00-120-301" 
1087. .1104 
/gene="PLI" 
/note="G00-120-301" 
/number=l 
1105. .>1658 
/gene="PLI" 
/note="G00-120-301 n 
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/number=l 

BASE COUNT 282 a 495 c 533 g 348 t 

ORIGIN 233 bp upstream of BamHI site. 



Query Match 2.5%; Score 25; DB 97; Length 1658; 

Best Local Similarity 69.8%; Pred. No. 2.68e-01; 

Matches 44; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Db 633 gcaggactctggcctgtgagggtggggttctggcttttcatgccccctgatgagggtcag 692 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Qy 587 GCGGGTCTCTCGGGGGCGAGGGCGAGGTTCAGGCCTTTCAGGCCGCAGGAAGAGGAACGG 64 6 ■ 

Db 693 age 695 
I I I 

Qy 647 AGC 64 9 



RESULT 15 
LUUUS 

DEFINITION 

ACCESSION 
NID 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 

gene 

CDS 



CDS 



PSESDSAB11 3510 bp DNA BCT 04-JUN-1992 

Pseudomonas sp. (strain ATCC 19151) sdsA gene and 11 kd protein, 
complete cds's; sdsB gene, partial cds . 
M86744 



gl51550 

11 kd protein; 

factor . 

Pseudomonas sp , 
Pseudomonas sp. 
Eubacteria; 
1 (bases 1 
Davison, J . , 



alkyl sulfatase; positive transcriptional control 
DNA. 



Proteobacteria . 
to 3510) 

Brunei, F. , Phanopoulos, A. , Prozzi, D. and Terpstra,P. 
Cloning and sequencing of Pseudomonas genes detdeterming sodium 
dodecyl sulfate biodegradation 
Gene 114, 19-24 (1992) 
92267380 

Location/Qualifiers 
1. .3510 

/organism="Pseudomonas sp." 

complement (57 . . 977 ) 

/gene="sdsB" 

complement (57 . . 977 ) 

/gene="sdsB" 

/codon_start=l 

/function="activates sdsA transcription" 
/evidence=experimental 

/product="positive transcriptional control factor" 

/db__xref="PID:gl51551" 

/trans l_table=ll 

/translation="MNDLRQLRHFVALAEHGHFARAAEAVNLSQPALSRSIQALENGL 

GCRLLDRGPRQVSLTAHGRLVLEHARRLLDGDRALRSAVSQLDNLGSGELRLGAGPYP 

GARLVPRALGRFAGAH PGVRVQLAIDTWYSLHQRLLDDALELFVADVRELRDDPQLEV 

TPLRSWPGVIFCRPGHPLLGRRRHRLTAADLAAYPLAGTQVPAEVAQALGQLAESGQP 

LGIECDN FMALKALVAESDVLSMAPLDVVAEEIEAGRLALLELAPGLLSQRSAYGLVS 

RAGRTLS PAAEAMRGLI L DE DART P P AS AR " 

1276. .2862 

/gene="sdsA" 

/codon_start=l 

/function="degrades sulfate esters of long-chain primary 

alcohols (SDS) n 

/evidence=experimental 

/product="alkyl sulfatase" 

/db_xref="PID:gl51552" 

/trans l_table=ll 

/translation="MIEAPEGLIIVDTGESVDQSRKVLAEFRKISDKPIKAIVYTHFH 
PDHINGVKAFVSEEQVKSGEVRIYAQETLLDNVVTQGSLVGPILTMRSGYSFGVALSD 
EDKRDMNAGLGPLAHEGASTFIAPTDTFRDSLDTTIAGLKVQFLHVPSEAPDEIVLYL 
PDNRVLISAEVTQGPTLPNVHTLRGTKFRDPVVWVASLDKLRAFQADVMVPLHGQPVS 
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gene 
CDS 



BASE COUNT 

ADTCTM 



GREKVEEVLRMTRDAIAYIHDQTVRWMNKGLTPDELVEKVKLPPHLAGYTPYLREYYG 

TVKHSVRQIYQGYLGWFQGDPVDLDPIPPAEKARRLIALMGGRDKVLMAAGDAYLKGD 

WQWAAELSGYAI RVDHDDKLARDI KARS FRRLG YASMN I NWRNWYLMSAMELEGKLEG 

DVALEMSRRVRAAFLSPDMLKNLPARIFLQNWVTRIDPEKSGDVELALG FAFPDIDEA 

WTLEVRRGVAQLKSGIDPAVPLRLTLDKRYLDTVISGENSLLKGALLGDVKVDGNLLD 

IKTFLGCFDFEDAPIALTVR" 

1276. .2862 

/gene="sdsA" 

2955. .3200 

/note="putative" 

/codon_start=l 

/function= "unknown" 

/product="ll kd protein" 

/db_xref="PID:gl51553" 

/transl_table=ll 

/ trans la tion="MHLPSFLKGALAMFWLVALLNVIYPFGETLHRPLLLASALVLLA 
HVGEAALFNRKLRARPSPWLERIQVLLFGFLHLRGLR" 
579 a 1252 c 1150 g 529 t 



Query Match 2.5%; 
Best Local Similarity 69.7%; 
Matches 53; Conservative 



Score 25; DB 32; Length 3510; 
Pred. No. 2.68e-01; 
0; Mismatches 22; Indels 1; 



Gaps 1 ; 



Db 631 ccggcgaagcaccccagcgcccggggca-ccaggcgggcgccggggtaggggccggcacc 689 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! 
Cp 54 5 CCGGAGAAGCCCCGGGCCGACCGCGGCCTCCAGGCGGGGTTCGGGGGCTGGGCAGGCGAC 48 6 

Db 690 caggcgcagttcgccg 705 

I I It I I I II II I 
Cp 4 85 CCGCCGCAGGTCCCCG 470 



Search completed: Thu Feb 5 05:30:23 1998 
Job time : 1615 sees. 
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